Unsupervised Automatic Speech Recognition: A review

نویسندگان

چکیده

Automatic Speech Recognition (ASR) systems can be trained to achieve remarkable performance given large amounts of manually transcribed speech, but labeled data sets difficult or expensive acquire for all languages interest. In this paper, we review the research literature identify models and ideas that could lead fully unsupervised ASR, including sub-word word modeling, segmentation speech signal, mapping from segments text. The objective study is limitations what learned alone understand minimum requirements recognition. Identifying these would help optimize resources efforts in ASR development low-resource languages.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speech Recognition System: A Review

Speech is the most prominent & primary mode of Communication among human beings. Now-a-days Speech also has potential of being important mode of interaction with computers. This paper gives an overview of Automatic Speech Recognition System, Classification of Speech Recognition System and also includes overview of the steps followed for developing the Speech Recognition System in stages. This p...

متن کامل

Automatic speech recognition and speech variability: A review

Major progress is being recorded regularly on both the technology and exploitation of automatic speech recognition (ASR) and spoken language systems. However, there are still technological barriers to flexible solutions and user satisfaction under some circumstances. This is related to several factors, such as the sensitivity to the environment (background noise), or the weak representation of ...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Active and unsupervised learning for automatic speech recognition

State-of-the-art speech recognition systems are trained using human transcriptions of speech utterances. In this paper, we describe a method to combine active and unsupervised learning for automatic speech recognition (ASR). The goal is to minimize the human supervision for training acoustic and language models and to maximize the performance given the transcribed and untranscribed data. Active...

متن کامل

Plasticity in Systems for Automatic Speech Recognition: A Review

Although the topic ‘plasticity in speech perception’ is primarily concerned with the malleability of human speech perceptual behaviour, it may be illuminating to consider in parallel the degree to which current state-of-the-art ‘automatic speech recognition’ (ASR) systems also change their behaviour over time. This paper provides a review of the computational mechanisms underlying contemporary ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2022

ISSN: ['1872-7182', '0167-6393']

DOI: https://doi.org/10.1016/j.specom.2022.02.005